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1 

Diagnostic and screening method 
Field of the invention 

The present invention relates to a novel diagnostic method for the 
5 detection of infertility in males. In particular, the present invention relates to a 
diagnostic method for detecting the presence or absence of a mutation or 
mutations in the POLG gene in a biological sample. The invention relates also 
to the use of a mutant POLG gene in the detection of infertility in males and in 
the screening of human populations for the presence of such mutation or 

10 mutations as a predictive test for male infertility. The present invention further 
relates to the use of the POLG gene as an indicator of other pathological 

k conditions associated with or related to male infertility, including those 
manifesting in women. 

Background of the invention 

15 Fertility problems have manifested increasingly in western countries 

due to a variety of social causes. For example, young couples delay their 
decision to establish a family for reasons of education and career, and tend to 
plan for smaller family size due to the increasing priority placed upon quality of 
life. Environmental and lifestyle factors may also play a part in revealing an 

20 underlying sub-fertility due primarily to other causes. For many couples, 
prolonged attempts to become pregnant end in failure, resulting in their 
seeking assistance from an infertility clinic. This decision is often taken long 
past the time when they initially would have hoped to start a family. People are 
understandably reluctant to take a step that may seem humiliating. Such 

25 delays, together with the attendant stress, anguish and shame, can have a 
profoundly destructive effect on relationships and on life quality generally. 

. Infertility can result from a great variety of causes, including 
anatomical, developmental, infectious and toxicological factors. The majority of 
cases can be attributed wholly or predominantly to one or other partner, with 

30 roughly equal numbers of cases of male and female factor infertility. In both 
instances, however, the primary cause is almost certainly genetic. Estimates of 
the true population prevalence of infertility vary, but are generally accepted to 
be in the range of 2-5%. This is therefore one of the commonest human 
genetic disorders. 

35 Male infertility can manifest as reduced quantity (oligozoospermia) 

or total lack (azoospermia) of sperm, reduced quantity of motile sperm 



2 



(asthenozoospermia) or morphologically abnormal sperm (teratozoospermia). 
These categories are not exclusive, since many infertile males appear to fall 
into more than one such class. Microscopic, cytological and biochemical 
methods are presently available for the determination of these defects. 
5 However, actual measurements can be highly variable, even for a single 
individual between samplings. Frequently, moreover, no actual cause for male 
infertility can be found. This can cause additional costs arising from the need 
for further clinical examinations. Selecting an appropriate method of assisted 
reproduction is often a haphazard and costly matter, with no guarantee of 

10 eventual success. All these difficulties and delays can place severe, additional 
strains on relationships. Additional means for the early detection or prediction 
K of infertility, in particular male infertility are thus needed. Early detection of 
male infertility would save both money and human stress and would allow 
appropriate counselling or assisted reproduction to be offered to a significant 

1 5 population of individuals who would otherwise only discover their problem after 
it has caused severe damage to their quality of life. 

Though it has been long suspected that male infertility in most 
cases is a genetic disorder, and furthermore that it is genetically hetero- 
geneous (i.e. that many different genes are the underlying cause of the 

20 disorder in different individuals), little information has emerged relating specific 
genetic defects and male infertility. The two known genetic causes of male 
infertility are the 'cystic fibrosis' mutations in the CFTR gene, part of a clearly 
recognizable clinical syndrome affecting approximately one person in 1500 
[see, e.g. Lissens, W. and Liebaers, I. Baillieres Clinical Obstetrics and 

25 Gynaecology 11 (1997) 797-817; Schnedl, W. et a/., Wiener Klinische 
Wochenschrift 103 (1991) 29-33] and a deletion on the Y chromosome, that 
leads to complete azoospermia, and is found in approximately 1-2% of infertile 
males in this category {Elliott, DJ. and Cooke, H.J. BioEssays 19 (1997) 801- 
809]. The supposed genetic cause of male infertility in the vast majority of 

30 cases has thus far remained unknown. 

Since spermatozoa are heavily dependent on respiratory energy for 
motility, impaired energy metabolism, whether in mature spermatozoa or at 
earlier stages of male germ cell differentiation, is a long hypothesized 
mechanism contributing to infertility. Defects in mitochondrial function [Johns, 

35 J.C.S. et a/., Nat Med 3 (1997) 124-125], possibly associated with mtDNA 
lesions [Kao, S.K et a/., Mol Hum Reproduction 4 (1998) 657-666; Lestienne 
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P. et a/., Mol Hum Reproduction 3 (1997) 811-814], have been reported in 
sperm samples from infertile males, and sperm motility appears to be 
correlated with mitochondrial respiratory activity [Ruiz Pesini, E. et a/., Clin 
Chem 44 (1998] 1616-1620] and membrane potential [Troiano. L. et a/., Exp 
5 Cell Res 241 (1998) 384-393]. However, no relationship between a specific 
mitochondrial gene and male infertility has been disclosed or even suggested. 
Short description of the invention 

It has now, surprisingly, been discovered that the frequency of a 
mutant genotype in the POLG gene encoding the catalytic subunit of 

10 mitochondrial DNA polymerase (DNA polymerase y) is significantly increased 
in groups of infertile males. Specifically, the mutant genotype has been located 
in the area of the trinucleotide (CAG) microsatellite repeat of the POLG gene 
within the N-terminal region of the coding sequence. While the normal POLG 
gene contains 10 consecutive, glutamine encoding CAG codons, followed by a 

15 single CAA and two further CAGs, at least one allele with an altered CAG 
repeat-length in the POLG gene is found in a large population of infertile men. 

The POLG mutations described in the present application represent 
the first significant step in efforts of elucidating the genetic nature of male 
infertility and provide novel means of diagnosis thereof. 

20 An object of the invention is thus to provide a non-invasive, non- 

intrusive diagnostic method that is useful in identifying, detecting and 
characterizing male infertility in the large fraction of cases where its causes 
remain unsolved. 

A second object is to provide a simple screening test of strong 

25 predictive value, in order to identify in advance those individuals who will 
require reproductive counselling and assistance. Such information will greatly 
enhance the expected quality of life of those who suffer from this disorder. 

The present invention relates to a new method for the diagnosis of 
male infertility by detecting the presence or absence of a mutation or 

30 mutations in the POLG gene encoding the catalytic subunit of mitochondrial 
DNA polymerase in a biological sample. 

The present invention also relates to the use of a mutant form of the 
POLG gene encoding the catalytic subunit of mitochondrial DNA polymerase 
for the diagnosis of male infertility. 
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The present invention also relates to the use of a mutant form of the 
POLG gene encoding the catalytic subunit of mitochondrial DNA polymerase 
as a diagnostic agent. 

The present invention further relates to diagnostic kits containing 
5 suitable reagents to detect a mutant form of the POLG gene or the normal 
POLG gene. 

The present invention further relates to the use of the POLG gene 
as an indicator of other pathological conditions associated with or related to 
male infertility, including those manifesting in women. 
1 0 Description of the drawings 

Figure 1 illustrates the genotype data of Table 2 as a bar-chart. 

Detailed description of the invention 

Microsatellite sequences (short sequences consisting of 
consecutive runs of a simple repeated element, such as a di- or trinucleotide) 

15 are well established sources of genetic polymorphism, exhibiting significant 
levels of length variation in the many genetic loci where they are found. This 
reflects the fact that they are inherently less stable than other sequences 
during, for example, DNA replication or repair, where slippage mis-pairing 
events may increase or decrease the number of re-iterated units. Many such 

20 sequences are found within the human genome, and are usually located 
outside of coding regions, where they have little consequence for phenotype. 
In some instances, however, they are located within or adjacent to coding 
DNA. In these cases, large expansions of the number of repeat units can have 
a profound effect, leading to loss or gain of genetic function and a resulting 

25 pathological phenotype. Instability at trinucleotide repeats is already known to 
be associated with various human disorders, including, for example, 
Huntington's disease (HD), myotonic dystrophy (DM), and several forms of 
spinocerebellar atrophy (e.g. SCAI). Other, more subtle variation at tri- 
nucleotide microsatellites can also affect phenotype, and the association 

30 described herein, between male infertility and variation in the CAG 
trinucleotide tract of the POLG gene, encoding the catalytic subunit of 
mitochondrial DNA polymerase, falls into such a category. 

Rovio, A. et a/, studied the frequency of different repeat-length 
alleles of the POLG gene in control populations, as well as in groups of 

35 patients suffering from recognized mitochondrial disorders (Eur J Hum Genet, 
1999 in press). The predominant allele of 10 CAG repeats was found at a very 
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similar frequency (approximately 88%) in both Finnish and ethnically mixed 
population samples, with homozygosity close to the equilibrium prediction of 
80%. Other alleles of between 5 and 13 repeat units were detected, but no 
larger, expanded alleles were found. Patients with a variety of molecular 
5 lesions in mtDNA showed no differences in POLG trinucleotide repeat-length 
distribution from controls. The authors concluded that their findings rule out 
POLG repeat expansion as a common pathogenic mechanism in disorders 
characterized by mitochondrial genome instability. 

The present invention is based on studies that were designed to 

10 test a hypothesis that this gene, encoding a major component of the 
machinery of mitochondrial DNA maintenance, is involved in infertility, via 
^ impaired energy metabolism of spermatozoa. The POLG gene encodes a 
protein of approximately 130 kDa and has been mapped to the region of 
chromosome 15q24-15q26 [Walker, R.L. et a/., Genomics 40 (1997) 376-378; 

15 Ropp, P.A. and Copeland, W.C., Genomics 36 (1996) 449^58; Zullo, S.J. et 
a/., Cytogenet Cell Genet 78 (1997) 281-284]. 

For PCR analysis, nested primers flanking the POLG CAG repeat 
were designed, based on a region of the coding sequence of POLG (see 
sequence id. no. 1). The primers amplified the predicted fragments from 

20 genomic DNA, as verified by direct sequencing. Testing against the mono- 
chromosomal interspecies hybrid and Genebridge 4 radiation hybrid panels 
[Gyapay, G. et al, Hum Mol Genet 5 (1996) 339-346] confirmed that they 
detected only the expected gene sequence from chromosome 15q, and not a 
pseudogene. The 5 1 primer (also designated mip51) was also found to 

25 generate the predicted products, using other, more downstream 3* primers 
from the same exon of the POLG gene. Using these primers and sperm DNA 
from individuals in whom azoospermia was excluded, the PCR analysis 
showed that 9 out of 99 male infertiles were homozygous for absence of the 
normal, 10 repeat-unit allele. Thus approximately 10% of infertile males, 

30 whose infertility is not clearly attributable to known causes, and excluding 
cases of azoospermia, completely lacked the common POLG allele, having 10 
consecutive CAG codons. Instead they possessed two alleles with a different 
number of CAG repeats. No instances of this genotype were found in sperm 
DNA from 98 fertile males. The association is highly significant based on two 

35 different statistical tests (the z-test for percentages, as well as Poisson 
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probabilities). The homozygous mutant genotype was found in only 2 of over 
250 healthy controls collected without regard to gender or fertility. 

Male infertiles also showed a higher frequency of heterozygosity 
than found amongst fertile males or in the general population. Some 35% of 
5 infertile males, compared with only about 20% in the fertile or unselected 
population, were heterozygous at the POLG CAG repeat, i.e. possessed one 
copy of the wild-type or normal allele with 10 repeats, and one other copy. 
These figures strongly indicate that many of the 35% of infertiles who are 
heterozygotes are, in fact, compound heterozygotes, i.e. that the allele of the 

10 POLG gene that is apparently wild-type or normal at the CAG repeat in fact 
carries a mutation elsewhere in the gene, such that both copies are defective, 
at least as regards the function that POLG performs in spermatogenesis that is 
required for fertility. 

PCR studies of POLG genotype frequencies in blood DNA from 

15 various categories of patients indicated that a mutation at this locus is not 
associated with azoospermia or with severe oligozoospermia, but is found in 
individuals with a range of sperm quality defects. 

Also fluorescent PCR using custom primers was used to genotype 
all individuals in the study for POLG CAG repeat length, as summarized in 

20 Table 2. Semen DNA from a minority (approx. 15%) of male infertiles yielded 
no clear signal and these were excluded from consideration. Approximately 
10%o of individuals lacked the normal allele. By contrast, this normal allele was 
present in all 98 fertile males studied. The normal allele was also absent in 
only one out of 1 18 healthy male controls, which had not been selected on the 

25 basis of fertility, in addition to the 250 controls studied previously without 
regard to gender or fertility. 

Given that the combined frequencies of mutant alleles (i.e. those 
having any number of repeats other than 10) revealed in the population survey 
of Rovio, A. et al. (supra) was 0.12, the homozygous mutant genotype is 

30 expected to be found in approximately 1.5% of randomly selected individuals, 
based on standard Hardy-Weinberg predictions (see Table 2). The above data 
indicate that it was found in the general population slightly below expectation 
(0.8%, i.e. 2 out of 252), although this deviation is not statistically significant. 
By contrast, finding the homozygous mutant genotype by chance in 9 out of 99 

35 individuals is exceedingly unlikely, assuming that the previously measured 
allele frequencies are representative. The association is significant (p < 0.01 ) 
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based on a z-test for percentages (comparing fertile and infertile groups), or 
using Poisson statistics (assuming a mean population frequency of the 
homozygous mutant genotype of 1 .5%). 

Many different alleles and combinations thereof were found in 
5 individuals lacking the common allele (Table 1). This indicates that it is 
absence of the normal allele, rather than the presence of a particular alternate 
allele, that is associated with the phenotype. All alleles found in interfiles were 
also found in controls. The frequencies, relative to one another, of the various 
mutant alleles found amongst interfiles were also similar to those found in the 

10 general population. The different profiles of POLG genotypes found in sperm 
DNA from fertile and infertile males, compared with controls, is illustrated in 

k Figure 1. 35 out of 99 (i.e. 35%) of interfiles were heterozygotes, carrying just 
one copy of the normal allele. 

This is well above expectation (21%), based on the measured allele 

1 5 frequencies in the overall population, or compared with the fertile control group 
(18%). Importantly, some of these could represent compound heterozygotes, 
with a second mutation mapping elsewhere in the gene than within the CAG 
repeat of the coding region of the gene. 

In order to establish the type of infertility associated with POLG 

20 genotype, blood DNA was studied from two larger groups of patients, who 
included a high proportion of azoospermic individuals and those with severe 
oligozoospermia (sperm concentration < 5 x 106 /ml), as well as individuals 
with only a mild abnormality of sperm concentration, motility and/or 
morphology. Sperm DNA was also analyzed from a further set of patients, for 

25 whom extensive phenotypic data were available, and which consisted of cases 
of 'pure asthenozoospermia', with sperm concentrations well into the normal 
range (generally > 100 x 10 6 /ml). 

Cases lacking clear results of sperm analysis were excluded, as 
were those where clinical or karyotypic examination clearly indicated an 

30 established hormonal, developmental or traumatic etiology of their infertility. 
The azoospermic and severe oligozoospermic patients were considered 
separately from those having sperm concentrations of 5 x 10 6 /ml or greater. 

Out of 62 azoospermic and 73 severe oligozoospermic individuals 
no cases of homozygous absence of the normal allele were detected. The 

35 frequency of heterozygotes in these groups (30%) was, however, slightly 
above the population average. By contrast, 8 out of the 113 with infertility of 
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unknown etiology, and having measured sperm concentrations of 5 x 10 6 /ml or 
greater, lacked the normal allele. 

Individuals homozygous for the absence of the normal allele were 
found in every category of sperm quality defect (sperm number, motility or 
5 morphology, or any combination). Most, however, were at or below at least two 
of the thresholds commonly used to define oligozoospermia (20 x 10 6 
sperm/ml), asthenozoospermia (50% motile sperm) and teratozoospermia 
(10% morphologically normal spermatozooa). However, no mutant genotype 
was found in semen DNAs from the 56 patients with 'pure astheno- 

10 zoospermia'. It is concluded that the mutant POLG genotype is associated with 
a general impairment in sperm quality and/or number, but neither with the 
y complete or almost complete absence of spermatozoa, nor with a motility 
defect as such. The simplest interpretation is that cell division or differentiation 
is impaired, leading to a low production of normal spermatozoa. 

15 The above data clearly indicate that POLG alleles other than the 

normal one of 10 repeat units are deleterious to sperm function or differ- 
entiation, but have little or no effect on other tissues. Polyglutamine tracts are 
commonly regarded as interfaces for protein-protein interactions, hence it is 
possible that a sperm-specific protein interacts with this region of the poly- 

20 peptide. Other possibilities are that repeat-length variants are inherently 
unstable, poorly targeted to mitochondria, or enzymatically defective, all of 
which are testable. The many rounds of cell division during spermatogenesis, 
plus the necessity, for the creation of functional spermatozoa, of maintaining a 
genetically fit mitochondrial genome, argue that a sub-optimal mtDNA 

25 polymerase, whether by virtue of impaired fidelity, processivity, or some other 
defect, could result in the accumulation of mtDNA mutations and failure to 
complete differentiation. 

Male infertility is certainly a genetically heterogeneous disorder 
[Sokol, R.Z., Curr Opinion Obst Gynaecol 7 (1995) 177-181]. The data of the 

30 present invention indicate that where there is no other obvious etiology, 
approximately 10%, and perhaps as many as 20% of cases are associated 
with a mutant genotype at the POLG locus, excluding those where sperm 
count is negligible. 

The measured frequency of mutant alleles in the population 

35 (approx. 0.12) means that approximately 1.5% of all males will lack the normal 
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allele, and are predicted to be infertile or at least subfertile, based on these 
findings. 

According to the diagnostic method of the present invention, the 
presence or absence of a mutation or mutations in the POLG gene can be 
5 detected from a biological sample by any known detection method suitable for 
the purpose. Thus, the presence or absence of a mutation or mutations in the 
POLG gene can be detected from a biological sample by any known method 
for detecting gene copy number or expression, i.e. methods based on 
detecting the copy number of the gene (or DNA) and/or those based on 

10 detecting the gene expression products (mRNA or protein). Such methods are 
easily recognized by those skilled in the art and include the use of the 
^ polymerase chain reaction (PCR) or other thermal cycler-based DNA synthetic 
techniques in the presence of appropriate oligonucleotide primers specific for 
the POLG gene, molecular cloning in a plasmid or other suitable vector, 

15 detection of length variants in a DNA sample by agarose or polyacrylamide gel 
electrophoresis or any analogous technique, gel or capillary electrophoresis 
with automated detection and analysis of products tagged with a fluorescent or 
other label incorporated into the DNA, DNA sequence determination by any 
technique, heteroduplex-based and associated methods for detecting base 

20 mismatches or length variants, mass spectroscopy, chromatographic and 
other separation methods, Western analyses, immunohistochemistry, and 
other immunoassays, and any technique suitable for the detection and 
functional characterization of nucleic acid and protein which may be applied in 
mutational analysis. Preferable methods are those suitable for use in routine 

25 clinical laboratories, or which may eventually be developed in self-test kits. 

In the diagnostic method of the invention, the biological sample can 
be any suitable tissue sample or body fluid, such as buccal smear 
(mouthwash), hair roots, semen, seminal plasma, whole blood, serum or 
plasma, or cultured cells derived from any biopsy. The biological sample can 

30 be, if necessary, pretreated in a suitable manner known to those skilled in the 
art, for the purposes of extraction of DNA in a form suitable for analysis. 

The diagnostic kit of the present invention comprises reagents 
necessary for the detection of a mutation or mutations in the POLG gene. 
These reagents include specific antibodies, preferably monoclonal antibodies, 

35 capable of identifying the POLG gene or its gene products, other antibodies, 
primers, markers and/or standards that are needed for visualization or 
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quantification as well as buffers, diluents, washing solutions and like, 
commonly contained in a commercial reagent kit. Alternatively, the diagnostic 
kit of the present invention may comprise portions of the POLG gene or its 
functional variant or fragment, or other nucleic acid-derivative related to it, 
5 together with suitable reagents, such as those listed above, needed for the 
detection of the mutation or mutations in the POLG gene. 

The present invention provides an additional means for detecting 
and analyzing male infertility, and a means for predicting it, when carried out 
on adults, children, in infancy, in utero or during embryonic development. 

10 The invention is now elucidated by the following non-limiting 

examples. The patients and controls used in the first example are as follows. 
Volunteers for the study, as authorized by the local ethical committee, were 
recruited from amongst male infertiles attending an infertility clinic in Oxford. 
Semen samples obtained by masturbation were stored frozen in liquid nitrogen 

15 for further study. Azoospermic individuals (approx. 20% of the total) were 
excluded, as were any for whom a clear, established etiology (hormonal, 
developmental, karyotypic or traumatic) could be found to account for their 
infertility. Semen was also collected from sperm donors, for use as fertile 
controls in the study. Semen or seminal plasma was also provided from 

20 similarly defined infertile males in Tampere, Finland and from infertiles in 
Australia and Taiwan. Blood samples were provided from fertile and infertile 
males, as well as from unselected control males in Dusseldorf, Germany and 
from male infertiles in Edinburgh, Scotland. Blood samples were also provided 
by otherwise healthy volunteers amongst laboratory personnel and associates, 

25 who were not screened for fertility. 
Example 1 

Genotyping at the POLG CAG repeat as a diagnostic test for 
male Infertility 

DNA was prepared from fresh or frozen sperm or seminal plasma 
30 as follows. 400 samples were micro-centrifuged at 13,000 rpm for 5 min and 
the pellets washed twice with phosphate buffered saline, pH 7.4 (PBS). Crude 
DNA was then extracted from the washed pellets by incubating with proteinase 
K as described by Reid, F.M. et at. [Hum Mutation 3 (1994) 243-247], followed 
by inactivation of the enzyme at 92°C for 10 minutes. Blood DNA was isolated 
35 by the same method, as described by Reid, F.M. et a/., supra. DNA can be 
obtained from other biological samples using essentially similar methods which 
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are well known to those skilled in the art. Prior to all PCR analyses, the DNA 
extracts were microcentrifuged for 15 sec and 0.5 jjlI of clear lysate used as 
template. In any cases where signals were low or uninterpretable, PCR was 
repeated using 0.5 \x\ of the template DNA solution diluted to various extents in 
5 order to reduce the concentration of contaminants that may inhibit DNA 
synthesis. More than 0.5 (al of template DNA solution may also be used, if 
sample concentration is judged to be too low. 

Fluorescent PCR analysis of POLG genotype was carried out as 
follows. Nested oligonucleotides corresponding to regions of the N-terminal 

10 coding sequence of POLG, as indicated in the accompanying list of 
sequences, were purchased from Life Technologies (Paisley, Scotland) and 

K DNA Technology (Aarhus, Denmark). One of the primers, mip51 (5' 
CCAGCTCCGTCCCCGCGTCCGACC 3'; sequence id. no. 2), was 5' 
prelabeled with the fluorescent dye ROX (PerkinElmer) by the manufacturer, 

1 5 but other dyes of similar application can also be used 

Fluorescent PCR reactions used, at 0.2 mM each, ROX-labeled 
primer mip51 plus one downstream primer, unlabelled, either mip31 (5' 
GCTGCCCGCCCTCCGAGGATAGCAC 3'; sequence id. no. 3; generating a 
126 bp PCR product from the wild-type allele, when used in combination with 

20 primer mip51), mip32 (5'CTCTCGAGAGCATCTGGATGTCCAATC 3'; 
sequence id. no. 4; generating a 165 bp PCR product from the wild-type allele, 
when used in combination with primer mip51) or mip33 
(5'CTCGTGCAGCCCTCTCGAGAGCAT 3'; sequence id. no. 5; generating a 
176 bp PCR product from the wild-type allele, when used in combination with 

25 primer mip51 from the wild type allele). Reactions were carried out in 12.5 jal at 
200 [iM dNTPs (Pharmacia Biotech, Sweden), plus an appropriate quantity of 
a suitable thermostable DNA polymerase (0.15 units of Dynazyme, from 
Finnzymes, Espoo, Finland) in the manufacturer's buffer. Reactions comprised 
30 cycles of denaturation for 1 min at 95°C, annealing for 45 sec at 62°C, and 

30 extension for 1 min at 72°C (5 min extension in final cycle). Products were 
diluted 1:10 in water, and samples containing 1 \±\ of diluted PCR product, 12 
^l of deionized formamide and labelled DNA fragment size markers in the 
appropriate size range were added [e.g. 0.15 [i\ of Tamra Genescan350 DNA 
size standards (Perkin Elmer)], and samples were analyzed by capillary 

35 electrophoresis on a suitable electrophoresis/fluorescent analysis instrument, 
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e.g. ABI 310 Genetic Analyzer (PerkinElmer), using the manufacturer's data 
collection and analysis software. 

Samples giving ambiguous or low signals, or those giving a result 
indicative of a mutant genotype, were re-evaluated using a second down- 
5 stream primer. In all analyses, a reaction using a template DNA sample 
previously demonstrated to contain only the wild-type POLG allele, as well as 
a reaction run without any added DNA, were run alongside as positive and 
negative controls, respectively. 

The results showing the mutant POLG alleles of male infertiles are 
10 shown in Table 1. 

Table 1 

Mutant POLG alleles of infertile males 



POLG repeat-length genotype 


Number 


11/11 


8 


11/12 


5 


11/9 


3 


9/12 


2 


11/13 




9/9 




8/11 




8/12 




6/12 




Total 


23 



15 

The PCR amplification and fluorescent analysis described enabled 
samples to be categorized as follows: 

Class I - WILD-TYPE (or normal) HOMOZYGOTES: only a single 
allele was detected, as a clean peak of a size consistent with the presence of 
20 10 consecutive copies of the CAG repeat. 
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Class li - MUTANT 'HOMOZYGOTES': one or two alleles were 
detected, both corresponding with length variants other than the wild-type 
allele of 10 CAG repeats. These individual are compound heterozygotes at the 
POLG CAG repeat However, since such individuals are homozygous for loss 
5 of the wild-type allele, they are placed in a different category from 'true 1 
heterozygotes as defined below (Class III). 

Class III - HETEROZYGOTES: two length alleles were detected, 
one corresponding with the wild-type allele (10 CAG repeats) and the other a 
different number. 

10 Data for such an analysis are illustrated in Figure 1 and Tables 2 

and 3, below. Controls were the combined sets collected previously from 

K Finland, being a mixture of Finns plus other ethnic groups of both sexes, plus 
those from Germany (males only, no selection based on fertility). Infertiles and 
fertiles indicated in Table 2 each represent pooled groups of individuals from 

15 Finland and England. 

Table 2 

POLG genotype frequencies detected in various fertile and infertile 
males (sperm), plus unselected controls (blood) 



Patients 


Class I: wild-type 


Class II: mutant 


Class III: 


(COUNTRY) 


homozygotes 


'homozygotes' 


heterozygotes 


Infertile males 


55 


9 


35 


(FIN/ENG) 


(56%) 


(9%) 


(35%) 


Fertile Males 


80 


0 


18 


(FIN/ENG) 


(82%) 


(0%) 


(18%) 


Total controls 


195 


2 


55 


(FIN/ENG/D) 


(77%) 


(1%) 


(22%) 



14 

Table 3 

Frequency of 'mutant homozygotes' (Class II) in various 
groups of infertile males 



Patients 

(COUNTRY, DNA SOURCE) 


Total 
analysed 


Class II: mutant 
'homozygotes' 


Azoospermia or severe oligozoospermia 
(sperm concentration < 5x 10 6 /ml) 

(D/SCO, blood) 


135 


0 

(0%) 


Pure asthenozoospermia (< 50% motile, 
normal sperm concentrations) 

(Taiwan, sperm) 


55 


0 

(0%) 


Combined sperm quality defect (2 or 
more of moderate oligozoospermia, 
asthenozoospermia, and 
teratozoospermia) 

(D, blood) 


113 


8 

(7%) 
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The analysis resolved the samples into the following groups in a 
clearly predictive manner, as follows. 

(1) Genotype frequencies in the control group were very close to 
the equilibrium predictions based on overall population-based measurements 

10 of allele frequencies of 0.88 (wild-type, 10 CAG repeats) and 0.12 (all mutant 
alleles combined). 

(2) No instances of 'mutant homozygotes' (Class II) were detected 
amongst fertile males. 

(3) Approximately 9% of infertile males, excluding cases of 
15 azoospermia and severe oligospermia, fell into Class II. 

(4) Heterozygotes (Class III) were found in all groups, but at a 
higher frequency in infertile than fertile males or controls. 

The diagnostic and counselling implications of the test for POLG 
genotype that forms the subject of the present invention, and as proposed to 
20 be carried out, for example, in an infertility clinic, are as follows: 

(1) A result falling into Class II is an indicator of a specific type of 
male factor infertility, associated with POLG dysfunction. This genotype is 
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never found in fertile males. Other clinical or biochemical investigations of both 
partners can essentially be dispensed with as unnecessary. Even in the 
absence of sperm quality data, it indicates that intra-uterine insemination and 
probably IVF are likely to fail, and that recourse to ICSI (intra-cytoplasmic 
5 sperm injection) is advisable. 

(2) A result falling into Class I does not exclude male factor infertility 
of other types, nor any other diagnosis. 

(3) A result falling into Class III is ambiguous, and warrants further 
genetic investigation of the POLG locus, as illustrated below (example 2). 

10 Example 2 

^ Complete analysis of the POLG gene as a diagnostic test for 

male infertility 

Results from the type of analysis illustrated in Example 1 are 
necessarily ambiguous, in the case of individuals who prove to be 

15 heterozygous at the POLG gene CAG repeat (i.e. who possess one copy of 
the wild-type, 10-repeat allele, plus one copy of some other length variant). In 
these cases, further investigation is warranted, in order to establish whether 
they represent cases of true heterozygotes (one fully functional, one mutant 
copy of the gene), as found amongst fertile males and controls, or whether 

20 they represent compound heterozygotes carrying one copy of the gene 
mutated within the POLG CAG repeat tract, and a second copy which has a 
pathological mutation elsewhere in the gene. The present invention thus 
covers not only the use of genotyping at the CAG repeat, but also the 
complete sequencing of the POLG gene as a diagnostic procedure for the 

25 determination of male infertility associated with POLG gene dysfunction, in 
cases where genotyping at the CAG repeat is insufficient to permit an 
unambiguous molecular diagnosis. Because such cases are expected to 
constitute up to half of all individuals with POLG-associated infertility, based 
on the observed frequencies of homo- and heterozygotes, i.e. given the 

30 excess number of heterozygotes amongst the infertiles, the further analysis 
illustrated in this Example will have important diagnostic value, as regards the 
counselling and treatment to be offered. 

DNA is prepared from the same sources and using the same 
methods as indicated under Example 1 (above). Long-extension PCR is 

35 carried out using a series of primer pairs flanking blocks of exons of the POLG 
gene, as illustrated in Figure 3, creating products that include the entire coding 
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region of the gene, the 5' and 3' untranslated segments of the POLG mRNA, 
the proximal promoter of the gene, and all intron/exon boundaries. 
Oligonucleotide primers are based on the publicly available sequence of the 
POLG gene (deposited in the Genbank database as accession number 
5 AC005317), and sourced from companies such as indicated under Example 1. 
In this specific example, illustrated in Figure 2, the relevant segments of the 
POLG gene are amplified initially as 3 large PCR products, although many 
other strategies are equivalent, based on other combinations of primers 
according to the database sequence. LX-PCR is carried out with a standard 

10 reagent kit (e.g Boehringer Mannheim Extend DNA Polymerase), under 
manufacturer's recommended conditions, or using equivalent materials 
sourced elsewhere. PCR products are analysed by agarose gel 
electrophoresis and purified from agarose gels by standard, spin-column 
methods, e.g. using kits available from Qiagen or similar manufacturers. A 

15 total of 52 sequencing primers are also required (one in each direction for the 
21 short exons and boundaries, two in each direction to cover the longer exon 
23, and three in each direction to cover the longest exon, exon 2, in which the 
CAG repeat lies, presenting additional difficulties. The 20 bp sequences of 
these correspond with intronic sequences located approximately 30 bp up- 

20 and downstream of the various intron-exon boundaries, based on the 
database-deposited gene sequence. In addition, internal exonic sequencing 
primers (one for each strand) are required for 20 bp regions in the middle of 
exon 23 and exon 2, as well as 20 bp on either side of the CAG repeat (one 
strand only, in each case, reading towards the repeat. 

25 Fluorescent DNA sequencing is carried out using a standard dye- 

terminator reagent kit (e.g. PerkinElmer Big Dye), in thermal cycling reactions 
using the long PCR products as template, and each sequencing primer as 
appropriate, under the kit manufacturer's recommended conditions. 
Sequencing reaction products are analyzed by gel or capillary electrophoresis, 

30 on a suitable electrophoresis/fluorescent analysis instrument, e.g. ABI 310 
Genetic Analyzer (PerkinElmer), using the manufacturer's data collection and 
analysis software. 

DNA sequences are carefully examined for evidence of 3 types of 
detectable heterozygosity, which manifest differently on the sequence traces, 

35 as follows, (a) A heterozygous point mutation manifests as a specific ambiguity 
localized to one nucleotide pair, and detectable on both strands in a congruent 
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manner (i.e. the specific ambiguity on one strand is found as the exactly 
complementary ambiguity on the other), (b) A heterozygous deletion or 
insertion of one or more nucleotides in a localized region manifests as a 
break-point between sequence that is readable (unambiguous) and sequence 
5 that is totally unreadable (ambiguous at virtually every position), with the exact 
location and size of the deletion or insertion inferred by the positions at which 
ambiguity commences when read on the two strands, (c) A large heterozygous 
deletion or insertion less than 10 kb in size, with both break-points located 
within the gene, is evident from heterogeneity in the product sizes in long 

10 PCR, with a need to sequence the two alternate products independently to 
infer the nature and location of the re-arrangement. 

A heterozygous deletion or insertion longer than 10 kb, or with only 
one break-point located within the gene does not necessarily manifest in a 
simple manner by such methods, but can be detected by the use of restriction 

15 mapping combined with Southern blotting, or by so-called one-sided PCR 
methods, in which the flanking regions of a given sequence in genomic DNA 
are characterized by restriction digestion, dilution, religation to form circles, 
and PCR using two adjacent, outwardly oriented primers. 

Where evidence of a heteozygous mutation is obtained, some form 

20 of haplotype analysis is still required, to confirm that the CAG repeat-length 
mutation and the additional heterozygous mutation are on different copies of 
the gene. This requires a combination of standard mutation detection 
methods, such allele-specific PCR, or restriction fragment length 
polymorphism applied over a considerable length of genomic DNA, involving, 

25 for example, long-extension PCR or Southern blotting or a combination of 
such techniques. 

The implications of the results from this much more exhaustive test 
are similar to those outlined under example 1. Assignment of a mutation may 
not be straightforward, but POLG mutations that cause changes of reading 
30 frame, that destroy splice sites or create new ones, that generate a stop 
codon, that alter a highly conserved amino acid or that delete or interrupt a 
significant region of coding sequence are unquestionably of pathological 
significance, if combined with CAG repeat-length mutation in the other copy of 
the gene. 

35 Detection of a second, clearly pathological mutation in such an 

individual, outside of the CAG repeat, would confirm that his infertility is of 
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essentially the same kind as of those already discussed under example 1, with 
similar implications for counselling and treatment. 
Example 3 

Application of the invention to population-based screening for 
5 genetic predisposition to infertility 

This example describes the way in which the invention is to be 
applied in the context of genetic screening of the population. It is assumed that 
this application will respect privacy, plus any relevant legislation in force in 
different jurisdictions. Its goal is to enable people to make informed choices 
10 about their reproductive life without prescribing to them any particular course 
of action. The possibility for a male to know, in advance, that he will suffer 
infertility at reproductive age, empowers both him and his partner in a way that 
other kinds of clinical testing do not, given that they are typically applied long 
after the problem has manifested, and usually too late for its consequences to 
15 be completely corrected or managed. This application could be placed in the 
context of a programme of voluntary genetic testing applied in infancy, or at 
any age, being offered as one of a large number of tests for genetic disorders. 
It could also be in the context of a confidential advisory service or even, 
eventually, a self-test kit. The specific example given is in the context of a 
20 voluntary public health programme, focused on the specific question of genetic 
predisposition to infertility, and offered in early adolescence. 

A defined population of young males (e.g. all those reaching their 
17th birthday in a given year in a given locality) are contacted via family 
doctors, educational institutions or a population registry, and asked whether 
25 they wish to participate voluntarily in the screening programme under 
conditions of total confidentiality. The benefits of choosing to do so, plus the 
predictive limitations of the test, are fully explained to all, enabling them to 
make an informed choice about their participation. Buccal smears (mouthwash 
samples) are collected from those agreeing to take the test, and DNA is 
30 extracted using similar methods as outlined under example 1. POLG 
genotyping is carried out using the same techniques as outlined in examples 1 
and 2 above. The results of the test are communicated to all volunteers on a 
confidential basis, with more specific counselling offered to anyone who 
requests it. 

35 The test results and their implications are clear: those found to be 

homozygous for loss of the wild-type POLG repeat-length allele, or to be 



compound heterozygotes with one copy of the gene mutated at the CAG 
repeat and one copy carrying a clearly pathological mutation elsewhere in the 
gene, are advised that prior genetic surveys indicate that they will suffer from a 
fertility problem, and that if they wish to have children they should consult an 
5 infertility clinic at the appropriate time, to arrange for assisted reproduction as 
already described under example 1. Those found to carry at least one wild- 
type copy of the gene are advised that one common, genetic cause of male 
infertility has been excluded, but that this does not necessarily mean that they 
will be free of fertility problems, since there are other genetic and 

10 environmental causes that account for a large fraction of fertility problems. 
Hence they should be aware of the relevant services available in their locality, 

y and advised to seek specific advice if they become aware of a fertility problem 
in the future. 

Obviously, the value of the test described in this example will be greatly 
15 enhanced, if combined in the future with tests for other genetic predispositions 
to male infertility, as these become apparent. This will strengthen the 
predictive value of the test, although even those in whom all genetic causes of 
male infertility have been excluded must be made aware that their partner may 
also suffer fertility problems independently, and that genetics cannot explain 
20 everything. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: Ancobio Ltd. 

(B) STREET: 375 West Regent St 

(C) CITY: Glasgow 

(E) COUNTRY: Scotland, UK 

(F) POSTAL CODE (ZIP): G2 4LH 

(ii) TITLE OF INVENTION: Diagnostic and screening method 

(iii) NUMBER OF SEQUENCES: 5 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(EPO) 

(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4440 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double (only one strand shown) 

(D) TOPOLOGY: linear 



(ii) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 



GCGGACCGGC CGGGTGGAGG CCACACGCTA CCCCGAGGCT GCGTAGGCCG CGCGAAGGGG 
GACGCCGTGC CGTGGGCCTG GGGTCGGGGG AGCAGCAGAC CGGGAAGCAC CGTGAGGACC 
GAGGATTTGG GGTGGAAGGC AGGCATGGTC AAACCCATTT CACTGACAGG AGAGCAGAGA 



60 
120 
180 
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L AbbAL b X b X 




f^TP mp PPPAfiP 


TAGTAAAAKA 


AGCCAAGCTG 


GAGCC CAAAG 


240 


t^Abb ivji iL 


TPAPTPPPAP. 




C C TGCAC CAA 


CCATGAGCCG 


CCTGCTCTGG 


300 


AbbrtAbb X bb 


PPGGPGCPAP 


PGTPfZfiGPCA 


GGGCCGGTTC 


CAGCTCCGGG 


GC GCT GGGT C 


360 


X V I rVljU X V 


m P P P P PP PT P 


PfZAPPPPAfSP 


GACGGGCAGC 


GGCGGCGGCA 


GCAGCAGCAG 


420 


UAb LA b LA bL 


A P.P A PP A f3P A 


APAf^PAfZPPT 


PAGPiAGCPGP 


AAGT GC TAT C 


CTCGGAGGGC 


480 


bbb< — AbL J- bL. 


PPP APA APPP 


A T"T(^P A P A TP 


PAGATGPTPT 


C GAGAGGGC T 


GCAC PA PP. A A 

vjv*i*v> wi\jL^nn 


540 


AIL 1 X LbbbL 


a M^apc/ya 

A^bbAbbbbA 


r* ta t Pr^ r* t cine 




T GP GP P GPJVG 


P GT P GAGPAP 


600 


L X b LAbAAbL 




fi/^/lPHVPl 
bbbbLnbLLA 


fipppi r rfippp r r 

bLLvr X bvvL X 


T GP P P G A P GT 


GGAGP TCP GP 


660 


L X bLLbLLLL 


*T"P m R P ClflflT* a 


LiviLL X wAL 


PAP^APTTPP 


crrTrprnnr 


rPAGAAGTAG 


720 


AGCCTGCCCT 


r pptpp a r*nr* 
ALL X bbAbbL 


bbLLAAL 1 X Lt 




LLLauAv X bV^V^ 


PPPGAAGPPP 




CCGGCTTGGG 


C C TbbbL bbA 


bGGLTbbALL 


LbbiALbbLL 


P P P A (ZflfZC P A 


PPP P (?T A P P P 




GTGGCCATCC 


LLbAbbAbCb 


0 0 r* fn ^* tt\ r* 

bGL C C- T Gvj X 


TTIPPIPPTCP 
X X LbALb X Izrij 


A CPTPTCPTT 
Abb X L X bL X X 


bbLAbAbbbn 




ACTTGCCCCA 


CATTGGLGGT 


GCiCuATAX LL 


LLL X LbbLLl 


fin t 1 & m m P p r r f2 

bbXAX X LL X b 


PT*PP A CP PIC 
b X bLAbLLAb 


q ah 


CGGCTGGTGG 


AAGAGCGT1A 


C-x L X TbbALL 


AbLLAbL X \s X 


P PP P P/2P TfZ A 
\_bLLbbL X brt. 


X LAI ^V^tL^V^i 


1020 


CTGGAGGTCC 


CTACTGGTGC 


UA(oLlAL>L LLL 


7ir^pp7ir'APAP 


AL X bbLAbbA 


(ZC A GTT A P.TC 
bL-AbX lAbl b 


1 nan 

j. \j 0 \j 


GTGGGGCAiJA 


ATGTTTCCTT 


m 7\ ppp t\ rip m 
X bALLbAbL X 


P A *T A *P P A C/Z^Z 


A ftlP A C m A P P T 


PATPPAPPPT 


1140 


TCCCGCATGU 


bx. X X LL X bbA 


r"" 1 RPPXfPPSPP 




PPATPTPAGG 


GPTAAGPAGP 


1200 


T X LLV\LH_.bLA 


blLlbl bbAX 


icriicppa An 


PAfZfZ/ZPAAAP 


APAAGGTPP-A 


PrPPPPPPAPA 


1260 


AA bL>VAbbL L 


a p a a PTppr a 

AbAAbX uLLA 


P APP A A Af^PP 


APA&fZAGGPP 


CAGCGATCTC 


AT CP.T GGGAC 


1320 


T CtGC TbbALA 


X UftGLAb X bX 


PllPlPTPTC 
1. tin 1 -Mi-i lLlb 


fZP A fZ A flfZ m niP 


APAGAfTTTA 


T GT AGGGGGG 


1380 


CCTCCLITAb 


T> A 7i P P 7i pp p 
AbAAbbAbLL 


TPpafiA tPT/i 


XXX la X V7fxn.urW 


GPAPPATGAA 


GGACAT TCGT 


1440 


GAGAACTTCC 


7S PPJipprnpiirn 
AbbALL X bAl 


r* r™< a p *r A P T" p 


npppAnnAPfi 

bLLwibwi'-b 


TGTGGGPPAP 


PPATGAGGTT 


1500 


TTCCACjCAbL 


AbL lALLbL X 


pttpttcpip 


a pp TPTPrrr 

Abb X blL^LL 


APPPAGTGAP 


TPTGGPCGGP 


1560 


ATGCTGGAbA 


x bbGXblLiL 


PTRPPT/IPPT 






TTArfTGCTA 


1620 


GAGGCACAGG 


bCALl X AX tiA 


rrivPPTPPiP 

LyLt/\tjL. X H_-/-iL» 


PPPPAP ATP A 
Lbub/ibAl bn 


AbAAbi Lb X X 


GATGGATCTG 


1680 


GC CAATGATG 


CCTGCCAbCT 




PaPHP-PTlPJi 
bAbAbb X ALA 


A A PA AfZAPPP 
AAbAAbAU v— V.. 


l_. X bbL X X bb 


1740 


GACCTGGAGT 


GGGACCTGCA 


AbAAl X lAAb 


LAbAAbAAAb 


rTaiiPaappT 

L X AAbAAbb X 


P A APA aPPA A 
bAAbAAbbAA 


1800 


C CAGC CACAG 


LLAbLAAbi X 


GC v_ LA X L tiAG 


bbbbL I Lsuob 


LLLL X J. b>i 


TPPP ATPPAT 
X LLL".! ijVj^-iX 


1860 


CAGGAAGACC 


TCGGCCCCTG 


CAGT GAGGAG 


GAGGAGTTTC 


AACAAGATGT 


CATGGCCCGC 


1920 


GCCTGCTTGC 


AGAAGCTGAA 


GGGGACCACA 


GAGCTCCTGC 


CCAAGCGGCC 


CCAGCACCTT 


1980 


CCTGGACACC 


CTGGATGGTA 


CCGGAAGCTC 


TGCCCCCGGC 


TAGACGACCC 


TGCATGGACC 


2040 


CCGGGCCCCA 


GCCTCCTCAG 


CCTGCAGATG 


CGGGTCACAC 


CTAAACTCAT 


GGCACTTACC 


2100 


TGGGATGGCT 


TCCCTCTGCA 


CT AC T CAGAG 


CGTCATGGCT 


GGGGCTACTT 


GGTGCCTGGG 


2160 


CGGCGGGACA 


ACCTGGCCAA 


GCTGCCGACA 


GGTACCACCC 


TGGAGTCAGC 


TGGGGTGGTC 


2220 


TGCCCCTACA 


GAGC CAT C GA 


GTCCCTGTAC 


AGGAAGCACT 


GTCTCGAACA 


GGGGAAGCAG 


2280 


CAGCTGATGC 


CCCAGGAGGC 


CGGCCTGGCG 


GAGGAGTTCC 


TGCTCACTGA 


CAATAGTGCC 


2340 
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ATATGGCAAA CGGTAGAAGA ACTGGATTAC TTAGAAGTGG AGGCT GAGGC CAAGATGGAG 24 00 

AACTTGCGAG CTGCAGTGCC AGGTCAACCC CTAGCTCTGA CTGCCCGTGG TGGCCCCAAG 2 4 60 

GACACCCAGC CCAGCTATCA CCATGGCAAT GGACCTTACA ACGACGTGGA CATCCCTGGC 2520 

TGCTGGTTTT TCAAGCTGCC TCACAAGGAT GGTAATAGCT GTAATGTGGG AAGCCCCTTT 2 580 

GCCAAGGACT TCCTGCCCAA GATGGAGGAT GGCACCCTGC AGGCTGGCCC AGGAGGTGCC 2 64 0 

AGTGGGCCCC GTGCTCTGGA AATCAACAAA ATGATTTCTT TCTGGAGGAA CGCCCATAAA 2700 

CGTATCAGCT CCCAGATGGT GGTGTGGCTG CCCAGGTCAG CTCTGCCCCG TGCTGTGATC 27 60 

AGGCACCCCG ACTATGATGA GGAAGGCCTC TATGGGGCCA TCCTGCCCCA AGTGGTGACT 2820 

GCCGGCACCA TCACTCGCCG GGCTGTGGAG CC CACAT GGC TCACCGCCAG CAATGCCCGG 2880 

CCTGACCGAG TAGGCAGTGA GTTGAAAGCC ATGGTGCAGG CCCCACCTGG CTACACCCTT 2940 

GTGGGTGCTG ATGTGGACTC CCAAGAGCTG TGGATTGCAG CTGTGCTTGG AGACGCCCAC 3000 

TTTGCCGGCA TGCATGGCTG CACAGCCTTT GGGTGGATGA CACTGCAGGG CAGGAAGAGC 3060 

AGGGGCACTG ATCTACACAG TAAGACAGCC ACTACTGTGG GCATCAGCCG TGAGCAT GCC 3120 

AAAATCTTCA ACTACGGCCG CAT CTATGGT GCTGGGCAGC CCTTTGCTGA GC GC T TACT A 3180 

ATGCAGTTTA ACCACCGGCT CACACAGCAG GAGGCAGCTG AGAAGGCCCA GCAGAT GT AC 3240 

GCTGCCACCA AGGGCCTCCG CTGGTATCGG CTGTCGGATG AGGGCGAGTG GCTGGTGAGG 3300 

GAGTTGAACC TCCCAGTGGA CAGGACT GAG GGTGGCTGGA TTTCCCTGCA GGATCTGCGC 3360 

AAGGTC CAGA GAGAAACTGC AAGGAAGTCA CAGTGGAAGA AG T GGGAGG T GGTTGCTGAA 3420 

CGGGCATGGA AGGGGGGCAC AGAGT CAGAA ATGTTCAATA AGCTTGAGAG CATTGCTACG 3480 

TCTGACATAC CACGTACCCC GGTGCTGGGC TGCTGCATCA GCCGAGCCCT GGAGCCCTCG 3540 

GCTGTCCAGG AAGAGTTTAT GACCAGCCGT GTGAATTGGG TGGTACAGAG CTCTGCTGTT 3600 

GACTACTTAC ACCTCATGCT TGTGGCCATG AAGTGGCTGT TTGAAGAGTT TGC CAT AGAT 3660 

GGGCGCTTCT GCAT CAGCAT C CAT GAC GAG GTTCGCTACC TGGTGCGGGA GGAGGACCGC 3720 

TACCGCGCTG CCCTGGCCTT GCAGAT CAC C AACCTCTTGA CCAGGTGCAT GTTTGCCTAC 3780 

AAGCTGGGTC TGAATGACTT GCCCCAGTCA GTCGCCTTTT T CAGTGCAGT CGATATTGAC 3840 

CGGTGCCTCA 6GAAGGAAGT GACCATGGAT TGTAAAACC C CTTCCAACCC AAC T GGGATG 3900 

GAAAGGAGAT ACGGGATTCC C CAGGGT GAA GC GC TGGATA TTTACCAGAT AATTGAACTC 3960 

ACCAAAGGCT CCTTGGAAAA ACGAAGCCAG CCTGGACCAT AGCACTGCCT GGAGGC TC TG 4 020 

TATTTGCTCC C GT GGAGCTT CAT C GGGGT G GTGCAGGCTC CCAAACTCAG GCTTTCAGCT 4080 

GTGCTTTTTG CAAAAGGGCT TGCCTAAGGC CAGCCATTTT TCAGTAGCAG GACCTGCCAA 4140 

GAAGATTCCT TCTAACTGAA GGTGCAGTTG AATTCAGTGG GTTCAGAACC AAGATGCCAA 4200 

CATCGGTGTG GACTACAGGA CAAGGGGCAT TGTTGCTTGT TGGGTAAAAA TGAAGCAGAA 4260 

GCCCCAAAGT T CACATT AAC TCAGGCATTT CAT TT ATT T T TTCCTTTTCT TCTTGGCTGG 4320 

TTCTTTGTTC TGTCCCCCAT GCTCTGATGC AGTGCCCTAG AAGGGGAAAG AATTAATGCT 4380 

C T AAC GT GAT AAACCTGCTC CAAGGCAGTG GAAATAAAAA GAAGGAAAAA AAAGAAAAAA 4440 
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(3) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
5' CCAGCTCCGTCCCCGCGTCCGACC 3' 



(4) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



5' GCTGCCCGCCCTCCGAGGATAGCAC 3' 



(5) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) SEQUENCE DESCRIPTION: SEQ ID NO: 4 
5'CTCTCGAGAGCATCTGGATGTCCAATC 3' 



(6) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) SEQUENCE DESCRIPTION: SEQ ID NO: 5 
5'CTCGTGCAGCCCTCTCGAGAGCAT 3' 



25 



Claims 

1. A method for the diagnosis of male infertility, characterized by 
detecting the presence or absence of a mutation or mutations in the POLG 
gene encoding the catalytic subunit of mitochondrial DNA polymerase in a 

5 biological sample. 

2. A method for population-based screening for genetic pre- 
disposition to male infertility characterized by detecting the presence or 
absence of a mutation or mutations in the POLG gene encoding the catalytic 
subunit of mitochondrial DNA polymerase in a biological sample. 

10 3. A method of claim 1 or 2, characterized in that the mutation or 

v mutations are located in the trinucleotide (CAG) microsatellite repeat of the 
POLG gene. 

4. A method of claim 3, characterized in that the mutation or 
mutations are located in both alleles of the POLG gene in the trinucleotide 

1 5 (CAG) microsatellite repeat of the POLG gene. 

5. A method of claim 1 or 2, characterized in that the mutation or 
mutations are located in or near a coding region of the POLG gene. 

6. A method of claim 1 or 2, characterized in that one mutation or 
mutations are located in one allele of the POLG gene in the trinucleotide 

20 (CAG) microsatellite repeat and another mutation or other mutations in the 
other allele of the mutant POLG gene in or near a coding region of the gene. 

7. A method of any one of claims 1 to 6, characterized in that the 
detection of the mutation is performed using a gene technological method. 

8. A method of claim 7, characterized in that the detection of the 
25 mutation or mutations is performed using the polymerase chain reaction (PCR) 

or other thermal cycler-based DNA synthetic techniques, molecular cloning in 
a plasmid or other suitable vector, detection of length variants in a DNA 
sample by agarose or polyacrylamide gel electrophoresis, gel or capillary 
electrophoresis and analysis of products tagged with a fluorescent or other 
30 label incorporated into the DNA, DNA sequence determination and any 
heteroduplex-based or similar methods for detecting base mismatches or 
length variants. 

9. A method of any one of claims 1 to 6, characterized in that the 
detection of the mutation or mutations is performed using an immunological 

35 method, such as a Western analysis, immunohistology or immunoassay, for 
characterization of a mutant gene or gene product. 
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10. A method of claim 9, characterized in that the detection of 
mutation or mutations is performed using immunohistology. 

1 1 . A use of a mutant form of the POLG gene encoding the catalytic 
subunit of mitochondrial DNA polymerase for the diagnosis or prediction of 

5 male infertility. 

1 2. A use of a mutant form of the POLG gene encoding the catalytic 
subunit of mitochondrial DNA polymerase as a diagnostic agent. 

13. A diagnostic kit, characterized in that it comprises reagents 
capable of identifying the presence of absence of a mutation of mutations in 

10 the POLG gene encoding the catalytic subunit of mitochondrial DNA 
polymerase. 

^ 14. A use of the POLG gene as an indicator of other pathological 

conditions associated with or related to male infertility, including those 
manifesting in women. 




(57) Abstract 

The present invention relates to a novel diagnostic 
method for the detection of infertility in males. In 
particular, the present invention relates to a diagnostic 
method for detecting the presence or absence of a 
mutation or mutations in the POLG gene encoding 
mitochondrial DNA polymerase in a biological sample. The 
invention relates also to the use of a mutant POLG gene 
in the detection of infertility in males and in the screening 
of human populations for the presence of such mutation or 
mutations as a predictive test for male infertility. The 
invention relates also the use of the POLG gene as an 
indicator of other pathological conditions associated with 
or related to male infertility, including those manifesting in 
women. 



